Frequent substructure-based approaches for classifying chemical compounds

نویسندگان
چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Finding Frequent Substructures in Chemical Compounds

The discovery of the relationships between chemical structure and biological function is central to biological science and medicine. In this paper we apply data mining to the problem of predicting chemical carcinogenicity. This toxicology application was launched at IJCAI’97 as a research challenge for artificial intelligence. Our approach to the problem is descriptive rather than based on clas...

متن کامل

Analysing and Classifying Names of Chemical Compounds with CHEMorph

We present a prototypical system with a purely linguistic method to analyse organic chemical compound names. It morpho-semantically analyses compound names, generates line-based, machinereadable representations of their corresponding molecular structures (SMILES strings), and triggers a taxonomic classification. CHEMorph is to be used to support manual database curation and as a basis for bioch...

متن کامل

Binary Substructure Descriptors for Organic Compounds*

Organic chemical structures are represented by binary vectors that contain information about presence or absence of 1365 substructures. The guiding ideas for selecting this set of substructures are described and examples are given. Software SubMat has been developed for a fast and flexible computation of binary substructure descriptors from molecular structures. Examples from structure similari...

متن کامل

Automated Approaches for Classifying Structures

In this paper we study the problem of classifying chemical compound datasets. We present an algorithm that first mines the chemical compound dataset to discover discriminating sub-structures; these discriminating sub-structures are used as features to build a powerful classifier. The advantage of our classification technique is that it requires very little domain knowledge and can easily handle...

متن کامل

Pattern Recognition Approaches for Classifying IP Flows

The assignment of an IP flow to a class, according to the application that generated it, is at the basis of any modern network management platform. However, classification techniques such as the ones based on the analysis of transport layer or application layer information are rapidly becoming ineffective. Moreover, in several network scenarios it is quite unrealistic to assume that all the cla...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Knowledge and Data Engineering

سال: 2005

ISSN: 1041-4347

DOI: 10.1109/tkde.2005.127